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Abstract 

The issue of a "mean shape" of a random set X of- 
ten arises, in particular in image analysis and pattern 
detection. There is no canonical definition but one 
possible approach is the so-called Vorob'ev expectation 
Ey(X), which is closely linked to quantile sets. In this 
paper, we propose a consistent and ready to use estima- 
tor of Ey (X) built from independent copies of X with 
spatial discretization. The control of discretization er- 
rors is handled with a mild regularity assumption on 
the boundary of X: a not too large 'box counting' 
dimension. Some examples are developed and an ap- 
plication to cosmological data is presented, 
keywords: Stochastic geometry ; Random closed sets 
; Level sets ; Vorob'ev expectation 
AMS codes: Primary 60D05 ; Secondary 60F15 ; 
28A80 

1 Introduction and background 

The present paper proposes a ready to use estimator 
based on level sets, and that allows to tackle the ques- 
tion of approximating the mean shape or expectation of 
a random set. The practical motivation of this work is 
given by pattern recognition applications coming from 
domains like astronomy, epidemiology and image pro- 
cessing [22, 23, 20, 19]. Several ways to define the ex- 
pectation of a random set have been developed in the 
literature (see e.g. [ ]). Our investigation leads us to 
the Vorob'ev expectation, closely related to quantiles 
and level-sets. 

Let (Q^A^'P) be a probability space. Consider a ran- 
dom compact set X in [0, 1]*^ as a map from Q to the 
class C of compact sets in [0, 1]^ measurable in the fol- 
lowing sense (see e.g. [14]): 

VCgC, {uj:X{uj)nC^il}}eA. 



Robbins' formula, a straightforward consequence of 
Fubini's theorem, states that 

EA(X) = / F{xe X)\{dx), 

J[0,1]^ 

where A is the Lebesgue measure. To define a "mean 
shape" , it is thus natural to consider the coverage func- 
tion p{x) = ¥{x G X). For a G [0, 1], the (determinis- 
tic) a-level set of p{x) is: 

Qa = {xe [0,lf:p{x) > a} 

or {p > a} for short. Choosing a such that the volume 
of Qa matches the mean volume of X provides the 
Vorob'ev expectation Ey(X) (see e.g. [5, 14, 15, 25]). 

Definition 1.1. Set F{a) = X{Qa) = ^{p > ct'i^d 

a* = inf{a G [0, 1] : F{a) < EA(X)}. 

The Vorob^ev expectation of X is a Borel set Ey(X) 
such that A(Ey (X)) EA(X) and 



{p > a*} C Ey(X) C{p> a*}. 



(1) 
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Uniqueness of Ey(X) is ensured if F is continuous at 
a*, and in this case one can choose Ey(X) = {p > a*} 
which is compact since p{x) is upper semi-continuous. 
It can be shown ([ , Theorem 2.3. p. 177]) that 
Ey(X) minimizes B EA(5 A X) under the con- 
straint X{B) = EA(X), where B A X is the symmetric 
difference of B and X. 

Despite their very natural definitions, neither the 
QaS nor Ey(X) are tractable for applications. First, 
the coverage probability p{x) is not always available 
in an analytical closed form. Second, level sets can 
not be computed for all the points x G [0, 1]^ and 
discretization should be considered. 
The first point has been tackled by a large literature: 
plug-in estimators obtained by replacing p{x) with 
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an empirical counterpart have been considered by 
Molchanov [10, 11, 13], Cuevas et al [2, 3]. In 
particular, the latter establishes consistency of plug-in 
estimators for the L^-norm under weak assumptions. 
In particular, the function p{x) needs not being 
continuous. 

The aim of this paper is to derive a consistent and 
implementable estimator i^n,r, based on n i.i.d. copies 
Xi , . . . , Xn of X and spatial discretizations with a grid 
of mesh r. We extend proofs of consistency of plug- 
in estimators to estimators including a grid discretiza- 
tion, and generalize existing works on level-sets for the 
Vorob'ev expectation, when the level a is replaced by 
a level a* that depends on X. We prove strong con- 
sistency for the symmetric difference and provide also 
convergence rates. 



the coverage function p{x) should not be required. As 
a result, the function F is cadlag with possible plateaus 
(constant regions). Note that F is the survival func- 
tion of the random variable p{U) with U uniformly 
distributed on [0,1]^. 
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Figure 1: Behavior of function F{a) = X{Qa) 



2 Estimation of level sets 

2.1 Plug-in estimation of level sets 

To define the plug- in estimators for sets Qa, let us 
consider the empirical counterparts of p{x): 

1 

i=l 

Then, the plug-in estimator is 

Qn,a = {Pn > Oi} - 

In the literature, several distances can be used 
for closed sets. Here, we are interested in the 
pseudo-distance defined for any A^B G S([0,1]^) 
by d{A,B) = X{AAB). Other possibilities are the 
Hausdorff distance, for which similar consistency 
results can be obtained, under additional assumptions 
on the regularity of p{x) (see e.g. [ ] and [ , Th. 
1 and 2]). In [lO, 11] for instance, p{x) is assumed 
continuous. The case where p{x) is a density function 
has also been much investigated. Kernel estimators 
may be used in the plug-in estimation. Rates of 
convergence [13, 16, 26] or asymptotic normality [ ] 
are considered with regularity assumptions on p{x). 
Adaptive estimation can also be performed [^8]. 

-consistency is of practical interest and we follow 
the approach developed for instance in [ ] , and general- 
ize their results with grid approximations in view of nu- 
merical implementation. For applications, we also aim 
at weak assumptions, in particular, the continuity of 



As an example. Figure 1 shows how the function F 
may look like. It can be observed that the plateaus of 
p{x) make the discontinuities of F while the disconti- 
nuities of p{x) provide the plateaus of F. The jump of 
F at ao corresponds to the Lebesgue measure of the 
points with coverage probability equal to ao 

F_(ao) - F{ao) X{p ao}. 

The plateau [ai,a2] of F means that the set of points 
with coverage probability between ai and a2 is A- 
negligible and X{p > ai} = X{p > a} for all a G 
[ai,a2]. 

2.2 Grid approximation of a level set 

The plug-in estimators Qn,a are not satisfactory in 
view of applications, because they require computa- 
tion of Pn{x) for all X. In all what follows, r is a mesh 
introduced to work in [0, l]^nrZ^ rather than in [0, 1]^. 
We seek for a candidate ^ close to Qn,a for small 
r, but less greedy in computations. 

Definition 2.1. For any Borel set B in [0, 1]*^ and 
r G 2~^^ we call '^grid approximation of B" the set 

5^= y [x,x + r)^. 
xeBnrZd 

As expected, some regularity of the border dB has 
to be introduced (see [ , p.38-39]): 

Definition 2.2. Let 

Nr{dB) = Card{x G rZ"^ : [x, x + r)"^ DdB^ 0}. 



The ^'upper box counting dimension^' of dB is 

log Nr{dB) 



dimbox(S^) = limsup ■ 



logr 



A straightforward consequence of these definitions 



is: 



Proposition 2.3. Assume that dimbox(S^) < d. For 
all e > 0, there exists r^ such that 



The following result extends the result of Cuevas et 
al. [ , Th. 3] by adding a discretization scheme: 



Proposition 2.4. Assume that dimbox(9-^) < d — k 
with probability one for some n > {). For all a such 
that \{p = a} = 0; 

(i) with probability 1, 



lim d(Q;,„,Q,) = 0. 

n^oo 



(a) for all £ > 0; 

Ed(Q;^,,Q,) < r"+2e-'"^'+F(a-£)-F(a+£). 

3 Estimation of Ey(X): 
Kovyazin's mean 

Let us consider the volume Fn{oi) = \{Pn > <^}- Fol- 
lowing the approach of [ ], we introduce first the em- 
pirical volume 



1 

An = - VA(X0, 
n ^ — ^ 



and next 



< = inf{aG[0,l]:Fn(a)<An}. 

Definition 3.1. The '^Kovyazin^s mean'' is a Borel set 
Kn defined by an empirical version of (1); 

\{Kn) = An and {pn > a* } C C {pn > a* }. 

The following revisits the approach of Kovyazin [5]: 

Theorem 3.2. Assume that X{p = a*} = 0. Then, 
with probability one, 



3.1 A grid approximation of 

Let us consider the grid [0, l]^nr7j^. A first idea could 
be to take the grid approximation of B = X or B = 
Kn , but this would provide a set B^ with mean volume 
not necessarily equal to A^. Instead, we set 

al^, = mf{a e [0, 1] : A {{pn > aY) < An} (2) 

and consider a Borel set i^n,r of volume A^ such that 

{Pn > Oi^^rV ^ ^n,r C {Pn > Oi^^rV - (3) 

The set Kn,r is the implementable estimator that we 
propose. Its definition amounts in the following pro- 
cedure. First, we approximate the expected volume of 
the random set X by A^, which gives the number of 
cells to select. The latter are chosen according to their 
estimated coverage, which is given by Pn{x) for the cell 
[x^x + r)^ with X G rZ n [0, 1]^. The consistency and 
convergence rate are provided by the following results. 

3.2 Consistency of K^^r 

Recall the definition of a* in Definition 1.1 and set 

/3* = sup{a G [0, 1] : F{a) > EA(X)}. (4) 
Our main theorem states that: 
Theorem 3.3. Assume that 



(i) P(dimbox(9-^) < d — k) = 1 for some n> {). 

(a) The Lebesgue measures of {p = a*} and {p = 
are zero. 



With probability one, 

d(i^n,r,Ey(X)) 



n^oo 



0. 



For the proof, we write that 

and use Theorem 3.2 and the two following lemmas to 
conclude. 



Lemma 3.4. Assume that ¥{dim]jox{dX) < d — n) = 1 
for some > 0. Then, with probability one 



a* < liminfa* ^ < limsup a* ^ < 6* , 

n^oo ''''' • ''''' 



n^oo 
>0 



and as a sort of particular case, we also have 
a* < lim inf a* < lim sup a* < /3* . 



Lemma 3.5. Assume P(dimbox(S-^) < d — tz) = 1 for 

some > 0. With probability one. 



hmsup d{Kn,r,Kn) < 2 lim^F(a) - 

r^O a<a* 



4 Some examples 

4.1 The region covered by a boolean 
model 

As an example of multi-dimensional random set, let 
us consider the Boolean model [ \ 12, 24], constructed 
as follows. First, take a Poisson point process in 
R'^ ; the parameter /i is a locally finite measure on 
called intensity measure. Next consider a sequence of 
i.i.d compact sets {'^x)xeM'^^ which is independent of 
. Finally, replace each point x of 11^ by the shifted 
corresponding set x -{-E^. The resulting union set 

is the Boolean model. The points x are called germs 
and the random set So is the 'typical' grain of the 
model. The Boolean model is also called the Pois- 
son germ-grain model. To avoid trivial models where 

5 = almost surely, the d-th moment of the radius 
of the circumscribed circle of Eq must be finite. 

The distribution of the Boolean model is uniquely 
determined by its capacity functional (see [12, 24]) 

TH(C)=P(SnC7^0) = l-exp[-E/i(So©C)j , (5) 

where C ranges over compact sets in R*^, Sq is the 
symetric of typical grain with respect to the origin and 
the Minkowski addition. 

Assume that the Boolean model is observed through 
the window [0,1]^ so that we focus on the random com- 
pact set 

x = sn[o,i]^ 

that is the covered region in the cube by S. That X is 
indeed compact requires some (known) work. Briefiy, if 
E is viewed as a random counting measure on the class 
C of compact sets in R^, its intensity measure will be 
locally finite if and only if E/i(So C) < oo for every 
compact C. In these conditions, only finitely many sets 
X -\- Ex will hit the cube [0,1]^ almost surely (see , , 
sections 3 and 4]). 

4.1.1 Stationary case 

This type of Boolean model refers to a translation- 
invariant intensity measure fi on R^ so that /jj{dx) = 
mX{dx) for some m > 0. In the following, the consid- 
ered grains E^ are balls of random radius R such that 
E(i?^) < oo. The coverage probability p{x) of X is 
obtained from (5) by taking C = {x} with x G [0, 1]^: 

p{x) = Ts{{x}) = 1 - exp [-mA(5(0, l))IE(i?^)] . 



Set for short Cm,d = 1 - exp [-mA(B(0, l))E(i?^)] . 
Since p{x) is the constant Cm,d^ it's easy to conclude 
that 



F{a) 



^)(a) and a* = Cm^d- 



Thus, Qa* = while Qa*-s = [0, 1]"^ for any £ > 0. In 
this case, E^(X) is not unique and any measurable set 
with volume EA(X) is a possible Vorob'ev expectation. 
Also, the assumption X{p = a*} = does not hold here 
and neither does Theorem 3.2. 

4.1.2 Non-stationary case 

This type of Boolean model may be obtained for locally 
finite intensity measures fi{dx) = m{x)X{dx) with a 
non-constant positive function m{x). We assume that 
the radius probability distribution P o admits a 
continous density g with respect to the Lebesgue mea- 
sure such that 



Vx e [0,1]^ 



[0,1] 



g{\x-y\)^^f,{dy)^0. (6) 

I \x-y\ 



The coverage probability of X requires the computa- 
tion of the function 

^(x) = E/i(So0{x}) 

= E/i(x + 5(0,i?)) 

F{R> \x-y\)fi{dy). 

[0,1]^ 

For a G (0,1), the level set consists of points x 
satisfying: 

p{x) > a O exp ( — (j){x)) < 1 — a 

^ (j){x) > In (-^ — y 

\1 — a/ 

Computing the gradient \/x^{R > \x — y\) for x ^ y 
gives the integrand of (6). Since g is continuous, <p is of 
class with the non- vanishing derivative (6). More- 
over, the boundary dQa coincide with the set {p = a} 
and is obtained as the solution of 



(j){x) — In 



Using the implicit function theorem, the boundary of 
Qa is a C^-manifold of dimension d — 1: it can be lo- 
cally parameterized by -functions of d — 1 variables. 
Assumption X{p = a} = is satisfied. In this case. 
Proposition 2.4 holds for any a G (0, 1) and Theo- 
rems 3.2 and 3.3 apply, which yields consistency and a 
rate of convergence for the estimator i^n,r- 



4.1.3 Boolean model with atoms 

In this section, we provide an example where the cov- 
erage function exhibits discontinuities. Consider the 
model S of Section 4.1.2 with in addition an inde- 
pendent random vector {Ui^ . . . ^Un) of independent 
Bernoulli random variables. Fix points 7/1, ... ,7/ at in 
[0, 1]"^ and ro > and set 

T = SU y B{y,,ro). 

i:Ui = l 

We have thus added a random union of deterministic 
balls to the model S. In the case where some of the 
P([/i = 1) are equal to 1, this model is reminiscent of 
the conditional Boolean model (see Lantuejoul [ ]). 
The coverage probability of X = T fi [0, l]'^ is: 

= 1 - e-^(^) Y[ F{Ui = 0). 

i:B{yi,ro)3x 

and p{x) > a is equivalent to 

^(^)- lnP(/7i=0)>ln(^^). (7) 

i:B{yi,ro)3x 

Set V = U^idB{yi^ro). If we work on each connected 
component of [0, 1]^ \ P, the sum in the l.h.s. of (7) is 
constant and we are reduced to the case considered in 
Section 4.1.2. Besides, p{x) is discontinuous on the set 
V. Thus F{a) exhibits plateaus. If p{x) is continuous 
at a*, then the Vorob'ev expectation is unique and 
consistency and rate of convergence can be obtained 
for Kn^r- Else p{x) is discontinuous at a* and there is 
no uniqueness of the Vorob'ev expectation. 

4.2 Example on real data 

The filamentary network is one of the most impor- 
tant features in studying galaxy distribution [ ]. The 
point field in Figure 2 is given by the galaxy centers of 
the NGP150 sample obtained from the 2 degree Field 
Galaxy Redshift Survey (2dFGRS) [ ]• It can be easily 
noticed that the galaxy positions are not spread uni- 
formly in the observation window and the filamentary 
structure is observable simply by eye investigation. 

The automatic delineation of these structure is an 
important challenge for the cosmological community. 
The authors in [22, 23] models these filamentary struc- 
tures by a cloud X of interacting cylinders C{x,r]) 
which are encoded by their centers x G K compact sub- 
set of and their directions 7^ G the unit sphere 
with positive z-ordinate. To obtain such an object. 




Figure 2: NGP150 sample from the 2dFGRS survey. 

these authors use a point process F in x which 
has density 

a[0) 

with respect to the Poisson point process 11^ on 
R^ X with the Lebesgue measure as intensity /i. 
The energy function of the model is given by the sum 
of two terms that depend on parameters ^ G a 
compact subset of R^. The term UdijlO) depends 
on the observed galaxy fields and is called the data 
energy. It expresses that the point process giving the 
centers x is spatially inhomogeneous: more points will 
be drawn in region where observations are dense. The 
term Ui{'y\6) is called the interaction energy and it is 
related to the connection and the alignement of the 
cylinders forming the pattern. 

The random set that is considered is hence: 

x = x(r) = ( U c{x,7j))nK. 

{x,r])er 

For full complete details concerning the model and the 
simulation dynamics, the reader should refer to in [22, 
23]. To fit our previous developped framework, the 
compact K should be chosen as (or imbedded in) [0, 1]^ 
by using homothetical scaling of the data. But this is 
completely inessential. 

The model (8) behaves as follows. If only the 
interaction energy is used, the model simulates a 



connected network which is independent of the data. 
If only the data term is used the model locates the 
filamentary regions but the cylinders are clustering in 
a disordered way and they neither connect nor form 
a network. The data energy acts as a non-stationary 
Poisson marked point process. The interaction 
energy acts as distance based pair-wise interaction 
marked point process and it plays the role of a reg- 
ularization term or a prior for the filamentary network. 

Heuristically, the idea in [22, 23] is to estimate the 
parameter 0^ which is unknown, and simulate realiza- 
tions of X from which information on the distribution 
of X can be obtained. For instance, if Xi,...,Xn 
are realizations of X, their Vorob'ev mean can help 
us identify the patterns of the filamentary network by 
providing a mean shape appearing under the distribu- 
tion of X. Tunning a model as (8) is not always a very 
simple task. Stoica et al. use a Bayesian frameworks 
and introduce a prior q{0) for the model parameters. 
Using a Metropolis-Hastings algorithm, throwing the 
burn-off period away and choosing simulations that are 
sufficiently distant, they generate a non-correlated se- 
quence (^i, X^)^^!! that should be identically dis- 
tributed if the algorithm converged correctly. The 
Metropolis-Hastings algorithm uses simulated anneal- 
ing and depends on a temperature parameter T, and 
several strategies can be applied. 

Once this is done, notice that the coverage function 
writes as: 

p{x) = E (^^ i^,^xin,)}h{n^\e)Q{e)de^ . (9) 

The Lebesgue measure of the level sets induced by (9) 
is given by the function 

F{a) = / l|p(^)>c,}A(dx). 

JK 

Since, p{x) is not available in closed analytical form it 
is not a trivial task to check the continuity of F{a) to 
check if we are in the assumptions of Theorem 3.3. 

4.2.1 Fixed temperature 

If the temperature remains constant during the 
Metropolis-Hastings algorithm, then the algorithm 
is expected to produce a sequence of non-correlated 
random variables (^^,r^) with density h{^\0)Q{9). 
The sequence (Xi)^^|i ... is then considered to be 
identically distributed. 



The first empirical test consists in running the sim- 
ulation dynamics at fixed temperature T = 1, and we 
simulate a sequence of n = 1000 realizations of X. The 
Monte Carlo counter-part of F{a) is shown in Figure 3. 
No observable discontinuity can be detected by simple 
visual inspection. Hence, using the same samples we 
obtain the empirical values = 1158 and a* = 0.36. 

1 4000 1 \ \ \ \ \ \ \ \ \ 

12000 - 
10000 - 
8000 



6000 




Figure 3: Monte-Carlo computation of the function 
F{a) for the galaxy catalogue NGP150 using a sim- 
ulation dynamics at fixed temperature. 

The estimator of the Vorob'ev expectation is shown 
in Figure 4. Even if running the simulation dynamics 
at fixed temperature does not provide an estimator of 
the filamentary pattern, the estimator of the Vorob'ev 
expectation can be used to assess the presence of a 
filamentary pattern in the data under the assumption 
that the model parameters are correct. The empirical 
computation of the sufficient statistics of the model can 
be also used in order to validate the presence of the 
filamentary pattern and also to give a more detailed 
morphological description [zu, zz, zo]. 

4.2.2 Decreasing temperature 

We now start with a low temperature Tq and choose 
the cooling schedule along the Metropolis-Hastings al- 
gorithm as 

Tn = r^^- (10) 

log n + 1 
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Figure 4: Estimator of the Vorob'ev expectation for the 
galaxy catalogue NGP150 using a simulation dynamics 
at fixed temperature. 

Under these circumstances, the sequence 
(^i, r^)^^|i ... can be considered as non-correlated 
random variables that are uniformly distributed on 
the set maximizing the joint probability density or 
minimizing the total energy of the system (see [2 ]) 

(^,7) = arg max (7 1 6>)^(l9) 

. f Udh\o) + Ui{i\o) , 
= ^'^E^l\ W) 

where ft denotes the set of simple point measures on 
X S^. Clearly, the solution we obtain is not unique. 

The second test was to obtain n = 1000 samples us- 
ing the simulated annealing algorithm with Tq = 1. 
The approximated F{a) function is shown in Fig- 
ure 5. Comparing with the previous case, the func- 
tion F{a) looks less smooth. Two other approxima- 
tions are done. First, the simulated annealing sim- 
ulates a non-homogeneous Markov chain. Second, the 
cooling schedule fixes the sequence of distributions con- 
verging towards the uniform distribution on the con- 
figuration sub-space maximizing h{j\0)g{0). Again us- 
ing the same samples we obtain the empirical values 
An = 1730.6 and < = 0.64. 

The estimator of the Vorob'ev expectation computed 
using the simulated annealing algorithm is shown in 
Figure 6. Comparing with the first experience, the 



Figure 5: Computation of the function F{a) for the 
galaxy catalogue NGP150 using a simulated annealing 
algorithm. 



filamentary pattern has a bigger volume. As a conse- 
quence, this method is more useful for detecting aver- 
age patterns of the filaments, while one would prefer 
the method with fixed temperature for estimation. 



The presented apphcations lead towards new ques- 
tions concerning the level sets estimators. The first 
question consists in determining a class of models that 
allow computation of level sets and Vorob'ev expecta- 
tion in the setting of this paper. A very intuitive an- 
swer leads to the following recommendations of using 
continuous intensity functions and interaction poten- 
tials. Clearly, this may be a quite restrictive condition. 
Hence, the second question would be whether contin- 
uous priors on the model parameter guarantee an ap- 
propriate behavior of the F{a) function. And finally a 
third question is whether there exists an optimal cool- 
ing schedule guaranteeing the working hypotheses for 
the construction of level sets based estimators. From 
our experiments, it seems for instance that constant 
temperature is good for estimating parameters while 
simulated annealing is better to detect patterns. 




Figure 6: Estimator of the Vorob'ev expectation for the 
galaxy catalogue NGP150 using a simulated annealing 
algorithm. 

A Appendix: Proofs 

Several times, the Strong Law of Large numbers in the 
separable Banach space ^^([0,1]"^) (L^ SLLN) wih be 
used (see e.g. [7, chap. 7]). 

Proof of Proposition 2.3 

It is not difficult to see that B A is included in the 
union of cells + r)^ (x G rZ"^ H [0, l]'^) that meet 
dB. As a result, we have 

X{B AB"^) < Nr{dB)r^. 

Besides, given £ > 0, we get from definition (2.2) that 
for r small enough, 

Nr{dB) < ^-dimbox(aB)-^^ 

These two inequalities provide the result. 
Proof of Lemma 3.4 

Consider a < a* such that F is continuous at a. We 
show that 

pfa < liminf a! ^ = 1. 

Otherwise, on a set of positive probability, we have a > 
liminfn-^oo a* ,^ and there exists a sequence {nk^rk)k>i 



with n/e — > oo and r/c ^ such that a > a* ^ . Then, 
from the definition (3), 

> A({p„, >ap) 

> A ({p„, > a}) - d ({p„, > aY" , {pn, > a}) 

> ^ {{Pnt > a}) - sup d {{pn > aY'' , {p„ > a}) . 

n 

Note that d{pn > a} C UlLi since Pn is lo- 
cally constant on the complementary of [Ji^idXi. 
Moreover, since the "upper box-counting dimension" 
dimbox(-) has monotonic and stability properties (see 
[4]), we get whatever a, for the constant introduced 
in the assumptions. 



dimbox(9{Pn > Oi}) < dimboxf M dXi 



= max dimbox ( dXi ] < d — n. 

l<i<n \ J 

It follows then from Proposition 2.3 that for k large 
enough, 

supd({pn > o^Y\{Pn > a}) < r^/', (12) 

n 

and thus 

An, > X{Pnk > (A-'^k^- 

Taking the limit as /c ^ oo and invoking the L^-SLLN 
for (pn)n>i 7 we obtain 

EA(X) > \{p > a} = F(a). (13) 

This contradicts the definition of a*. As a consequence, 
we have 

Pfa* < liminfa* ) = 1. 

Very similarly, we can prove that 

Pf limsup<^ < = 1. 

The particular case without the mesh r is in the same 
spirit: we can show that there exists a sequence Uk ^ 
oo such that A^, > ^{{Puk > <^}) which leads to (13). 



Proof of Lemma 3.5 

From d{A,B) = X{A) - X{B) + 2X{B \ A), and since 



-d(Kn,r,i^n) < A > \ {Pn > <}) 



a* = It's enough to prove that, with probabihty 
one, 



Pn - (^n 



^ p — a . 



(15) 



Indeed together with the assumption X{p = a*} 
this yields 



< A {{pn > <,,}^ \ {pn > <, J) 

+ A {{pn > <, J \ {Pn > <}) MPn-< < 0,p > a*} ^ A{p-a* < 0,p > a*} = 0, 

< d {{pn > <, J^ {Pn > <,r}) ^ud thus TUn teuds to as n ^ oo. But (15) stems 

+ A{a* ^ < Pn < Q^n}- from Lemma 3.4 that gives a* a* with proba- 

L^([0,1]^) . . , T T .,. 

The first term in the majoration is with probability one bility one and Pn > p also with probability 

less than r'^/^ for r small enough, as in (12). For the o^^- 
second one, let a < a* and /3 > /3* ; by Lemma 3.4, 

there exists n^,,^ > 1 and r«,^ > such that for all < /^*- We distinguish three subcases. If a* < a* 



n > na,(3 and r G (0,ra,^), 

A{<,r <Pn< <} < A{a < < /^}, 

and again by the L^-SLLN for (pn)n>i, 
A{a < < /3} > X{a<p<f3} = F{a) - F{(3), 

n^oo 

provided F is continuous at a and /3. We deduce that 
with probability one 



then 

rrin < X{pn < al,p> a*} 

< A{pn <<,P>r} + A{a* <p</3*} 
= A{pn<<,P>/3*} 

+ A{p = a*} + (F(a*)-F(/3*-)). 



The two last terms are zero, thus: 



limsupd(if„,„if„) < 2 [F{a) - F{l3)] , 

n^oo 

and the proof is easily completed. 
Proof of Theorem 3.2 

Consider Borel sets A, 5, G [0,1] and coverage 

functions p, q such that 

{p > a} C A C {p > a} and {q > fi} C B C {q > 
From d(A, B) = X{A) - X{B) + 2X{B \ A), we deduce 

d{AB)<\X{A)-X{B)\ 

+ 2 [A{p > a, g < /3} A A{p < a, g > , 

and for A = Kn and B = Ey(X), we get 

d(Kn,Ey(X)) < |An - A(Ey(X))| + 2mn (14) 

with 

rrin = A{pn > a^,p < a*} A A{pn < a* ,p > a*}. 



<A{|pn-p|>r-c^*}< 



\\Pn-p\\ 



/3* - a* 



by using the Markov inequality. If a* > then 
starting from rrin ^ ^{Pn > <^n'-P — ^*} obtain 
the same upper-bound. Finally, when a* < a* < 

(/3*-a*)mn < (/3*-OA{pn<<,p>r} 

+ « - a*)A{pn > <,p < a*} 

< IIpu-pIIli. 

In the three cases, rrin is bounded by ||pn — 
p\\l^/ (/^* ~ <^*) which converges to zero a.s. 



Proof of Proposition 2.4 

By triangular inequality and Proposition 2.3, 

Ed(Q;,,,Q«) <r^^Ed{Qn,a.Qa)^ 

It remains to control the last term Ed((5n,Q;, Qa)- We 
have 



The first term in (14) converges a.s. towards by . . . . 

the SLLN. The second one requires to distinguish two ^(Qri,a^ Qa) = X{Pn > a, p < a\ ^ X{pn <a,p>a\ 

< X{xe [0,1]^: \pn{x)-p{x)\ > \p{x)-a\}. 



cases: 
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Taking expectation and using Fubini's theorem and 
Bernstein's inequality, one gets for all e > 



< 



< 



J[0,1] 

J{W-v\ 



n\Pn{x)-p{x)\>\p{x) 



2e 



-2n\oi-p{x)Y 



\{dx) 



J{\a-p\<e} i^^J 



HI 



-e). 
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